Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 891221 |
| Missing cells | 161560 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 78443 |
| Duplicate rows (%) | 8.8% |
| Total size in memory | 115.6 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Categorical | 10 |
|---|---|
| Numeric | 7 |
| Dataset has 78443 (8.8%) duplicate rows | Duplicates |
HEALTH_TYP is highly correlated with NATIONALITAET_KZ and 3 other fields | High correlation |
NATIONALITAET_KZ is highly correlated with HEALTH_TYP and 1 other fields | High correlation |
PRAEGENDE_JUGENDJAHRE is highly correlated with HEALTH_TYP | High correlation |
SEMIO_SOZ is highly correlated with ANREDE_KZ | High correlation |
SHOPPER_TYP is highly correlated with HEALTH_TYP and 1 other fields | High correlation |
VERS_TYP is highly correlated with HEALTH_TYP and 2 other fields | High correlation |
ANREDE_KZ is highly correlated with SEMIO_SOZ | High correlation |
GEBURTSJAHR is highly correlated with PRAEGENDE_JUGENDJAHRE | High correlation |
HEALTH_TYP is highly correlated with VERS_TYP | High correlation |
PRAEGENDE_JUGENDJAHRE is highly correlated with GEBURTSJAHR and 1 other fields | High correlation |
SEMIO_SOZ is highly correlated with ANREDE_KZ | High correlation |
VERS_TYP is highly correlated with HEALTH_TYP | High correlation |
ANREDE_KZ is highly correlated with SEMIO_SOZ | High correlation |
ALTERSKATEGORIE_GROB is highly correlated with PRAEGENDE_JUGENDJAHRE | High correlation |
GEBURTSJAHR is highly correlated with PRAEGENDE_JUGENDJAHRE | High correlation |
HEALTH_TYP is highly correlated with VERS_TYP | High correlation |
PRAEGENDE_JUGENDJAHRE is highly correlated with GEBURTSJAHR | High correlation |
SEMIO_SOZ is highly correlated with ANREDE_KZ | High correlation |
VERS_TYP is highly correlated with HEALTH_TYP | High correlation |
ANREDE_KZ is highly correlated with SEMIO_SOZ | High correlation |
NATIONALITAET_KZ is highly correlated with VERS_TYP and 3 other fields | High correlation |
ALTERSKATEGORIE_GROB is highly correlated with AGER_TYP and 3 other fields | High correlation |
AGER_TYP is highly correlated with ALTERSKATEGORIE_GROB and 1 other fields | High correlation |
RETOURTYP_BK_S is highly correlated with ALTERSKATEGORIE_GROB and 1 other fields | High correlation |
VERS_TYP is highly correlated with NATIONALITAET_KZ and 4 other fields | High correlation |
GEBURTSJAHR is highly correlated with PRAEGENDE_JUGENDJAHRE | High correlation |
ZABEOTYP is highly correlated with GREEN_AVANTGARDE and 1 other fields | High correlation |
GREEN_AVANTGARDE is highly correlated with ZABEOTYP and 1 other fields | High correlation |
HEALTH_TYP is highly correlated with NATIONALITAET_KZ and 3 other fields | High correlation |
PRAEGENDE_JUGENDJAHRE is highly correlated with NATIONALITAET_KZ and 9 other fields | High correlation |
ANREDE_KZ is highly correlated with SEMIO_SOZ | High correlation |
SEMIO_SOZ is highly correlated with ANREDE_KZ | High correlation |
SHOPPER_TYP is highly correlated with NATIONALITAET_KZ and 4 other fields | High correlation |
CJT_GESAMTTYP is highly correlated with VERS_TYP | High correlation |
NATIONALITAET_KZ is highly correlated with HEALTH_TYP and 2 other fields | High correlation |
HEALTH_TYP is highly correlated with NATIONALITAET_KZ and 2 other fields | High correlation |
VERS_TYP is highly correlated with NATIONALITAET_KZ and 2 other fields | High correlation |
SHOPPER_TYP is highly correlated with NATIONALITAET_KZ and 2 other fields | High correlation |
SOHO_KZ has 73499 (8.2%) missing values | Missing |
TITEL_KZ has 73499 (8.2%) missing values | Missing |
TITEL_KZ is highly skewed (γ1 = 39.64777145) | Skewed |
GEBURTSJAHR has 392318 (44.0%) zeros | Zeros |
PRAEGENDE_JUGENDJAHRE has 108164 (12.1%) zeros | Zeros |
TITEL_KZ has 815562 (91.5%) zeros | Zeros |
Reproduction
| Analysis started | 2021-05-17 19:50:27.286332 |
|---|---|
| Analysis finished | 2021-05-17 19:53:06.546268 |
| Duration | 2 minutes and 39.26 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.9 MiB |
| -1 | |
|---|---|
| 2 | |
| 1 | |
| 3 | 27104 |
| 0 | 8340 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.760196405 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1568724 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | -1 |
| 3rd row | -1 |
| 4th row | 2 |
| 5th row | -1 |
Common Values
| Value | Count | Frequency (%) |
| -1 | 677503 | |
| 2 | 98472 | 11.0% |
| 1 | 79802 | 9.0% |
| 3 | 27104 | 3.0% |
| 0 | 8340 | 0.9% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 757305 | |
| 2 | 98472 | 11.0% |
| 3 | 27104 | 3.0% |
| 0 | 8340 | 0.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 757305 | |
| - | 677503 | |
| 2 | 98472 | 6.3% |
| 3 | 27104 | 1.7% |
| 0 | 8340 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 | |
| Dash Punctuation | 677503 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 757305 | |
| 2 | 98472 | 11.0% |
| 3 | 27104 | 3.0% |
| 0 | 8340 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 677503 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1568724 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 757305 | |
| - | 677503 | |
| 2 | 98472 | 6.3% |
| 3 | 27104 | 1.7% |
| 0 | 8340 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1568724 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 757305 | |
| - | 677503 | |
| 2 | 98472 | 6.3% |
| 3 | 27104 | 1.7% |
| 0 | 8340 | 0.5% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4854 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.632838316 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.595021092 |
|---|---|
| Coefficient of variation (CV) | 0.4390564493 |
| Kurtosis | -1.068626824 |
| Mean | 3.632838316 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.03888466953 |
| Sum | 3220028 |
| Variance | 2.544092284 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 210963 | |
| 3 | 156449 | |
| 6 | 153915 | |
| 2 | 148795 | |
| 5 | 117376 | |
| 1 | 98869 | |
| (Missing) | 4854 | 0.5% |
| Value | Count | Frequency (%) |
| 1 | 98869 | |
| 2 | 148795 | |
| 3 | 156449 | |
| 4 | 210963 | |
| 5 | 117376 | |
| 6 | 153915 |
| Value | Count | Frequency (%) |
| 6 | 153915 | |
| 5 | 117376 | |
| 4 | 210963 | |
| 3 | 156449 | |
| 2 | 148795 | |
| 1 | 98869 |
| Distinct | 117 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1101.178533 |
| Minimum | 0 |
|---|---|
| Maximum | 2017 |
| Zeros | 392318 |
| Zeros (%) | 44.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1943 |
| Q3 | 1970 |
| 95-th percentile | 1990 |
| Maximum | 2017 |
| Range | 2017 |
| Interquartile range (IQR) | 1970 |
Descriptive statistics
| Standard deviation | 976.5835513 |
|---|---|
| Coefficient of variation (CV) | 0.88685306 |
| Kurtosis | -1.941480575 |
| Mean | 1101.178533 |
| Median Absolute Deviation (MAD) | 46 |
| Skewness | -0.240357039 |
| Sum | 981393433 |
| Variance | 953715.4326 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 392318 | |
| 1967 | 11183 | 1.3% |
| 1965 | 11090 | 1.2% |
| 1966 | 10933 | 1.2% |
| 1970 | 10883 | 1.2% |
| 1964 | 10799 | 1.2% |
| 1968 | 10792 | 1.2% |
| 1963 | 10513 | 1.2% |
| 1969 | 10360 | 1.2% |
| 1980 | 10275 | 1.2% |
| Other values (107) | 402075 |
| Value | Count | Frequency (%) |
| 0 | 392318 | |
| 1900 | 4 | < 0.1% |
| 1902 | 1 | < 0.1% |
| 1904 | 5 | < 0.1% |
| 1905 | 8 | < 0.1% |
| 1906 | 7 | < 0.1% |
| 1907 | 4 | < 0.1% |
| 1908 | 7 | < 0.1% |
| 1909 | 7 | < 0.1% |
| 1910 | 41 | < 0.1% |
| Value | Count | Frequency (%) |
| 2017 | 593 | |
| 2016 | 167 | < 0.1% |
| 2015 | 257 | < 0.1% |
| 2014 | 124 | < 0.1% |
| 2013 | 380 | |
| 2012 | 806 | |
| 2011 | 485 | |
| 2010 | 545 | |
| 2009 | 559 | |
| 2008 | 550 |
GFK_URLAUBERTYP
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4854 |
| Missing (%) | 0.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.350304107 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.525723215 |
|---|---|
| Coefficient of variation (CV) | 0.4796703869 |
| Kurtosis | -1.23285991 |
| Mean | 7.350304107 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.2416175217 |
| Sum | 6515067 |
| Variance | 12.43072419 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 138545 | |
| 5 | 120126 | |
| 10 | 109127 | |
| 8 | 88042 | |
| 11 | 79740 | |
| 4 | 63770 | |
| 9 | 60614 | |
| 3 | 56007 | |
| 1 | 53600 | 6.0% |
| 2 | 46702 | 5.2% |
| Other values (2) | 70094 |
| Value | Count | Frequency (%) |
| 1 | 53600 | |
| 2 | 46702 | 5.2% |
| 3 | 56007 | |
| 4 | 63770 | |
| 5 | 120126 | |
| 6 | 27138 | 3.0% |
| 7 | 42956 | 4.8% |
| 8 | 88042 | |
| 9 | 60614 | |
| 10 | 109127 |
| Value | Count | Frequency (%) |
| 12 | 138545 | |
| 11 | 79740 | |
| 10 | 109127 | |
| 9 | 60614 | |
| 8 | 88042 | |
| 7 | 42956 | 4.8% |
| 6 | 27138 | 3.0% |
| 5 | 120126 | |
| 4 | 63770 | |
| 3 | 56007 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 891221 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 891221 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 891221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 715996 | |
| 1 | 175225 | 19.7% |
HEALTH_TYP
Categorical
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.4 MiB |
| 3 | |
|---|---|
| 2 | |
| 1 | |
| -1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.124768155 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1002417 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 162388 | |
| -1 | 111196 | 12.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 273584 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 273584 | |
| - | 111196 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 | |
| Dash Punctuation | 111196 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 273584 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 111196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1002417 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 273584 | |
| - | 111196 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1002417 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 310693 | |
| 2 | 306944 | |
| 1 | 273584 | |
| - | 111196 | 11.1% |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.3 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 65418 |
| 3 | 33403 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 891221 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 891221 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 891221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 684085 | |
| 0 | 108315 | 12.2% |
| 2 | 65418 | 7.3% |
| 3 | 33403 | 3.7% |
PRAEGENDE_JUGENDJAHRE
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.154345555 |
| Minimum | 0 |
|---|---|
| Maximum | 15 |
| Zeros | 108164 |
| Zeros (%) | 12.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 8 |
| Q3 | 14 |
| 95-th percentile | 14 |
| Maximum | 15 |
| Range | 15 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 4.844532197 |
|---|---|
| Coefficient of variation (CV) | 0.5941043538 |
| Kurtosis | -1.11662633 |
| Mean | 8.154345555 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.2507805175 |
| Sum | 7267324 |
| Variance | 23.4694922 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 188697 | |
| 8 | 145988 | |
| 0 | 108164 | |
| 5 | 86416 | |
| 10 | 85808 | |
| 3 | 55195 | 6.2% |
| 15 | 42547 | 4.8% |
| 11 | 35752 | 4.0% |
| 9 | 33570 | 3.8% |
| 6 | 25652 | 2.9% |
| Other values (6) | 83432 |
| Value | Count | Frequency (%) |
| 0 | 108164 | |
| 1 | 21282 | 2.4% |
| 2 | 7479 | 0.8% |
| 3 | 55195 | 6.2% |
| 4 | 20451 | 2.3% |
| 5 | 86416 | |
| 6 | 25652 | 2.9% |
| 7 | 4010 | 0.4% |
| 8 | 145988 | |
| 9 | 33570 | 3.8% |
| Value | Count | Frequency (%) |
| 15 | 42547 | 4.8% |
| 14 | 188697 | |
| 13 | 5764 | 0.6% |
| 12 | 24446 | 2.7% |
| 11 | 35752 | 4.0% |
| 10 | 85808 | |
| 9 | 33570 | 3.8% |
| 8 | 145988 | |
| 7 | 4010 | 0.4% |
| 6 | 25652 | 2.9% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4854 |
| Missing (%) | 0.5% |
| Memory size | 50.9 MiB |
| 5.0 | |
|---|---|
| 3.0 | |
| 4.0 | |
| 1.0 | |
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2659101 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 5.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 3.0 |
| 4th row | 2.0 |
| 5th row | 5.0 |
Common Values
| Value | Count | Frequency (%) |
| 5.0 | 297993 | |
| 3.0 | 231816 | |
| 4.0 | 131115 | |
| 1.0 | 129712 | |
| 2.0 | 95731 | 10.7% |
| (Missing) | 4854 | 0.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 5.0 | 297993 | |
| 3.0 | 231816 | |
| 4.0 | 131115 | |
| 1.0 | 129712 | |
| 2.0 | 95731 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 886367 | |
| 0 | 886367 | |
| 5 | 297993 | 11.2% |
| 3 | 231816 | 8.7% |
| 4 | 131115 | 4.9% |
| 1 | 129712 | 4.9% |
| 2 | 95731 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1772734 | |
| Other Punctuation | 886367 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 886367 | |
| 5 | 297993 | 16.8% |
| 3 | 231816 | 13.1% |
| 4 | 131115 | 7.4% |
| 1 | 129712 | 7.3% |
| 2 | 95731 | 5.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 886367 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2659101 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 886367 | |
| 0 | 886367 | |
| 5 | 297993 | 11.2% |
| 3 | 231816 | 8.7% |
| 4 | 131115 | 4.9% |
| 1 | 129712 | 4.9% |
| 2 | 95731 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2659101 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 886367 | |
| 0 | 886367 | |
| 5 | 297993 | 11.2% |
| 3 | 231816 | 8.7% |
| 4 | 131115 | 4.9% |
| 1 | 129712 | 4.9% |
| 2 | 95731 | 3.6% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.945859669 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.946564233 |
|---|---|
| Coefficient of variation (CV) | 0.4933181603 |
| Kurtosis | -1.353534476 |
| Mean | 3.945859669 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1789455842 |
| Sum | 3516633 |
| Variance | 3.789112312 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 244714 | |
| 6 | 136205 | |
| 5 | 121786 | |
| 3 | 118889 | |
| 7 | 117378 | |
| 4 | 90161 | 10.1% |
| 1 | 62088 | 7.0% |
| Value | Count | Frequency (%) |
| 1 | 62088 | 7.0% |
| 2 | 244714 | |
| 3 | 118889 | |
| 4 | 90161 | 10.1% |
| 5 | 121786 | |
| 6 | 136205 | |
| 7 | 117378 |
| Value | Count | Frequency (%) |
| 7 | 117378 | |
| 6 | 136205 | |
| 5 | 121786 | |
| 4 | 90161 | 10.1% |
| 3 | 118889 | |
| 2 | 244714 | |
| 1 | 62088 | 7.0% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.4 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 0 | |
| -1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.124768155 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1002417 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | 3 |
| 3rd row | 2 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 254761 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | |
| -1 | 111196 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 365957 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | 14.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 365957 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | 12.7% |
| - | 111196 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 | |
| Dash Punctuation | 111196 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 365957 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | 14.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 111196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1002417 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 365957 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | 12.7% |
| - | 111196 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1002417 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 365957 | |
| 2 | 207463 | |
| 3 | 190219 | |
| 0 | 127582 | 12.7% |
| - | 111196 | 11.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73499 |
| Missing (%) | 8.2% |
| Memory size | 49.6 MiB |
| 0.0 | |
|---|---|
| 1.0 | 6888 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2453166 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 810834 | |
| 1.0 | 6888 | 0.8% |
| (Missing) | 73499 | 8.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0.0 | 810834 | |
| 1.0 | 6888 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1628556 | |
| . | 817722 | |
| 1 | 6888 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1635444 | |
| Other Punctuation | 817722 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1628556 | |
| 1 | 6888 | 0.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 817722 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2453166 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1628556 | |
| . | 817722 | |
| 1 | 6888 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2453166 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1628556 | |
| . | 817722 | |
| 1 | 6888 | 0.3% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73499 |
| Missing (%) | 8.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.003482846248 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 815562 |
| Zeros (%) | 91.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.08495716307 |
|---|---|
| Coefficient of variation (CV) | 24.39302714 |
| Kurtosis | 1998.880458 |
| Mean | 0.003482846248 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 39.64777145 |
| Sum | 2848 |
| Variance | 0.007217719557 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 815562 | |
| 1 | 1947 | 0.2% |
| 5 | 104 | < 0.1% |
| 4 | 57 | < 0.1% |
| 3 | 49 | < 0.1% |
| 2 | 3 | < 0.1% |
| (Missing) | 73499 | 8.2% |
| Value | Count | Frequency (%) |
| 0 | 815562 | |
| 1 | 1947 | 0.2% |
| 2 | 3 | < 0.1% |
| 3 | 49 | < 0.1% |
| 4 | 57 | < 0.1% |
| 5 | 104 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 104 | < 0.1% |
| 4 | 57 | < 0.1% |
| 3 | 49 | < 0.1% |
| 2 | 3 | < 0.1% |
| 1 | 1947 | 0.2% |
| 0 | 815562 |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.4 MiB |
| 2 | |
|---|---|
| 1 | |
| -1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.124768155 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1002417 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | -1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 398722 | |
| 1 | 381303 | |
| -1 | 111196 | 12.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 492499 | |
| 2 | 398722 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 492499 | |
| 2 | 398722 | |
| - | 111196 | 11.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 | |
| Dash Punctuation | 111196 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 492499 | |
| 2 | 398722 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 111196 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1002417 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 492499 | |
| 2 | 398722 | |
| - | 111196 | 11.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1002417 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 492499 | |
| 2 | 398722 | |
| - | 111196 | 11.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3624376 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 6.8 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.352704299 |
|---|---|
| Coefficient of variation (CV) | 0.4022987071 |
| Kurtosis | -0.2449129835 |
| Mean | 3.3624376 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.02846326419 |
| Sum | 2996675 |
| Variance | 1.82980892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 364905 | |
| 4 | 210095 | |
| 1 | 123622 | 13.9% |
| 5 | 84956 | 9.5% |
| 6 | 74473 | 8.4% |
| 2 | 33170 | 3.7% |
| Value | Count | Frequency (%) |
| 1 | 123622 | 13.9% |
| 2 | 33170 | 3.7% |
| 3 | 364905 | |
| 4 | 210095 | |
| 5 | 84956 | 9.5% |
| 6 | 74473 | 8.4% |
| Value | Count | Frequency (%) |
| 6 | 74473 | 8.4% |
| 5 | 84956 | 9.5% |
| 4 | 210095 | |
| 3 | 364905 | |
| 2 | 33170 | 3.7% |
| 1 | 123622 | 13.9% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.3 MiB |
| 2 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 891221 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 891221 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 891221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 465305 | |
| 1 | 425916 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 49.3 MiB |
| 3 | |
|---|---|
| 4 | |
| 2 | |
| 1 | |
| 9 | 2881 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 891221 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 1 |
| 3rd row | 3 |
| 4th row | 4 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 891221 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 891221 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 891221 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 358533 | |
| 4 | 228510 | |
| 2 | 158410 | |
| 1 | 142887 | 16.0% |
| 9 | 2881 | 0.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| AGER_TYP | CJT_GESAMTTYP | GEBURTSJAHR | GFK_URLAUBERTYP | GREEN_AVANTGARDE | HEALTH_TYP | NATIONALITAET_KZ | PRAEGENDE_JUGENDJAHRE | RETOURTYP_BK_S | SEMIO_SOZ | SHOPPER_TYP | SOHO_KZ | TITEL_KZ | VERS_TYP | ZABEOTYP | ANREDE_KZ | ALTERSKATEGORIE_GROB | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | -1 | 2.0 | 0 | 10.0 | 0 | -1 | 0 | 0 | 5.0 | 2 | -1 | NaN | NaN | -1 | 3 | 1 | 2 |
| 1 | -1 | 5.0 | 1996 | 10.0 | 0 | 3 | 1 | 14 | 1.0 | 5 | 3 | 1.0 | 0.0 | 2 | 5 | 2 | 1 |
| 2 | -1 | 3.0 | 1979 | 10.0 | 1 | 3 | 1 | 15 | 3.0 | 4 | 2 | 0.0 | 0.0 | 1 | 5 | 2 | 3 |
| 3 | 2 | 2.0 | 1957 | 1.0 | 0 | 2 | 1 | 8 | 2.0 | 5 | 1 | 0.0 | 0.0 | 1 | 3 | 2 | 4 |
| 4 | -1 | 5.0 | 1963 | 5.0 | 0 | 3 | 1 | 8 | 5.0 | 6 | 2 | 0.0 | 0.0 | 2 | 4 | 1 | 3 |
| 5 | 3 | 2.0 | 1943 | 1.0 | 0 | 3 | 1 | 3 | 3.0 | 2 | 0 | 0.0 | 0.0 | 2 | 4 | 2 | 1 |
| 6 | -1 | 5.0 | 0 | 12.0 | 0 | 2 | 1 | 10 | 4.0 | 2 | 1 | 0.0 | 0.0 | 1 | 4 | 2 | 2 |
| 7 | -1 | 3.0 | 1964 | 9.0 | 0 | 1 | 1 | 8 | 5.0 | 7 | 0 | 0.0 | 0.0 | 1 | 1 | 1 | 1 |
| 8 | -1 | 3.0 | 1974 | 3.0 | 1 | 3 | 1 | 11 | 4.0 | 4 | 3 | 0.0 | 0.0 | 2 | 6 | 1 | 3 |
| 9 | -1 | 4.0 | 1975 | 12.0 | 1 | 2 | 1 | 15 | 4.0 | 2 | 3 | 0.0 | 0.0 | 2 | 4 | 2 | 3 |
Last rows
| AGER_TYP | CJT_GESAMTTYP | GEBURTSJAHR | GFK_URLAUBERTYP | GREEN_AVANTGARDE | HEALTH_TYP | NATIONALITAET_KZ | PRAEGENDE_JUGENDJAHRE | RETOURTYP_BK_S | SEMIO_SOZ | SHOPPER_TYP | SOHO_KZ | TITEL_KZ | VERS_TYP | ZABEOTYP | ANREDE_KZ | ALTERSKATEGORIE_GROB | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 891211 | -1 | 2.0 | 1963 | 1.0 | 0 | 3 | 1 | 8 | 5.0 | 4 | 1 | 0.0 | 0.0 | 1 | 6 | 1 | 3 |
| 891212 | -1 | 1.0 | 0 | 4.0 | 0 | 1 | 1 | 3 | 5.0 | 6 | 2 | 0.0 | 0.0 | 1 | 3 | 1 | 4 |
| 891213 | -1 | 5.0 | 1966 | 8.0 | 1 | 1 | 1 | 11 | 2.0 | 2 | 1 | 0.0 | 0.0 | 1 | 4 | 2 | 4 |
| 891214 | -1 | 4.0 | 1978 | 10.0 | 0 | 3 | 1 | 14 | 4.0 | 5 | 3 | 0.0 | 0.0 | 2 | 5 | 2 | 1 |
| 891215 | -1 | 6.0 | 0 | 12.0 | 0 | 2 | 2 | 10 | 1.0 | 2 | 1 | 0.0 | 0.0 | 1 | 4 | 2 | 2 |
| 891216 | -1 | 5.0 | 1976 | 12.0 | 0 | 3 | 1 | 14 | 3.0 | 2 | 3 | 0.0 | 0.0 | 1 | 4 | 2 | 3 |
| 891217 | -1 | 4.0 | 1970 | 1.0 | 0 | -1 | 0 | 10 | 5.0 | 4 | -1 | 0.0 | 0.0 | -1 | 6 | 1 | 2 |
| 891218 | -1 | 4.0 | 1976 | 10.0 | 0 | 1 | 1 | 14 | 4.0 | 5 | 2 | 0.0 | 0.0 | 1 | 4 | 2 | 2 |
| 891219 | -1 | 3.0 | 1994 | 9.0 | 0 | 1 | 1 | 14 | 4.0 | 7 | 0 | 0.0 | 0.0 | 2 | 5 | 1 | 1 |
| 891220 | -1 | 1.0 | 0 | 12.0 | 0 | 2 | 1 | 3 | 1.0 | 6 | 2 | 0.0 | 0.0 | 1 | 3 | 1 | 4 |
Most frequently occurring
| AGER_TYP | CJT_GESAMTTYP | GEBURTSJAHR | GFK_URLAUBERTYP | GREEN_AVANTGARDE | HEALTH_TYP | NATIONALITAET_KZ | PRAEGENDE_JUGENDJAHRE | RETOURTYP_BK_S | SEMIO_SOZ | SHOPPER_TYP | SOHO_KZ | TITEL_KZ | VERS_TYP | ZABEOTYP | ANREDE_KZ | ALTERSKATEGORIE_GROB | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 51132 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 4 | -1 | 0.0 | 0.0 | -1 | 4 | 1 | 3 | 467 |
| 51121 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 2 | -1 | 0.0 | 0.0 | -1 | 3 | 2 | 3 | 441 |
| 51140 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 5 | -1 | 0.0 | 0.0 | -1 | 3 | 2 | 3 | 389 |
| 51130 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 4 | -1 | 0.0 | 0.0 | -1 | 3 | 1 | 3 | 260 |
| 50166 | -1 | 5.0 | 0 | 10.0 | 0 | -1 | 0 | 0 | 5.0 | 4 | -1 | 0.0 | 0.0 | -1 | 4 | 1 | 3 | 251 |
| 51142 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 5 | -1 | 0.0 | 0.0 | -1 | 4 | 2 | 3 | 236 |
| 50154 | -1 | 5.0 | 0 | 10.0 | 0 | -1 | 0 | 0 | 5.0 | 2 | -1 | 0.0 | 0.0 | -1 | 3 | 2 | 3 | 223 |
| 51115 | -1 | 5.0 | 0 | 12.0 | 0 | -1 | 0 | 0 | 5.0 | 1 | -1 | 0.0 | 0.0 | -1 | 3 | 2 | 3 | 194 |
| 50174 | -1 | 5.0 | 0 | 10.0 | 0 | -1 | 0 | 0 | 5.0 | 5 | -1 | 0.0 | 0.0 | -1 | 3 | 2 | 3 | 191 |
| 34942 | -1 | 4.0 | 0 | 12.0 | 0 | 3 | 1 | 8 | 5.0 | 6 | 1 | 0.0 | 0.0 | 2 | 3 | 1 | 3 | 189 |